vig

Under the Semantic Web view of OBDA, OBDA systems take as input an OWL 2 QL ontology (TBox), a database instance over some database schema, and a mapping file that relates queries over the database to assertions in the ontology (ABox).

Benchmarks for OBDA systems should ideally give means to perform scalability analyses with respect to each of these measures of the input. The NPD benchmark allows for scalability analyses with respect to the size of the data component by using VIG to scale a data instance produced starting from real-world data available in the NPD FactPages.

Althoug VIG is currently being used with the NPD benchmark, it is not specific to that setting. To generate data with VIG, indeed, it suffices to provide it a source data instance, stored in a mysql RDBMS, and (optionally) a mappings file. Notice that the mysql requirement is not really a limitation as conversion tools between RDBMS systems are generally available, and because the only information from the schema used by VIG consists of primary and foreign key constraints (i.e., triggers or stored procedures do not need to be translated). VIG produces data in the form of CSV files, that can be then imported into any RDBMS system. Before including VIG in your benchmark, please check whether the generation strategy and similarity measures guaranteed by it are suitable for your scenario, by referring to the wiki page [[Characteristics of The Data Produced By VIG]]. Briefly, your decision should be based on the following criteria: